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1 CCCGGGCGAG GACCCCTCCA GGATGCAGGT CCCGAACAGC ACCGGCCCGG 
51 ACAACGCGAC GCTGCAGATG CTGCGGAACC CGGCGATCGC GGTGGCCCTG 
101 CCCGTGGTGT ACTCGCTGGT GGCGGCGGTC AGCATCCCGG GCAACCTCTT 
151 CTCTCTGTGG GTGCTGTGCC GGCGCATGGG GCCCAGATCC CCGTCGGTCA 
201 TCTTCATGAT CAACCTGAGC GTCACGGACC TGATGCTGGC CAGCGTGTTG 
251 CCTTTCCAAA TCTACTACCA TTGCAACCGC CACCACTGGG TATTCGGGGT 
301 GCTGCTTTGC AACGTGGTGA CCGTGGCCTT TTACGCAAAC ATGTATTCCA 
351 GCATCCTCAC CATGACCTGT ATCAGCGTGG AGCGCTTCCT GGGGGTCCTG 
401 TACCCGCTCA GCTCCAAGCG CTGGCGCCGC CGTCGTTACG CGGTGGCCGC 
451 GTGTGCAGGG ACCTGGCTGC TGCTCCTGAC CGCCGTGTCC CCGCTGGCGC 
501 GCACCGATCT CACCTACCCG GTGCACGCCC TGGGCATCAT CACCTGCTTC 
551 GACGTCCTCA AGTGGACGAT GCTCCCCAGC GTGGCCATGT GGGCCGTGTT 
601 CCTCTTCACC ATCTTCATCC TGCTGTTCCT CATCCCGTTC GTGATCACCG 
651 TGGCTTGTTA CACGGCCACC ATCCTCAAGC TGTTGCGCAC GGAGGAGGCG 
701 CACGGCCGGG AGCAGCGGAG GCGCGCGGTG GGCCTGGCCG CGGTGGTCTT 
751 GCTGGCCTTT GTCACCTGCT TCGCCCCCAA CAACTTCGTG CTCCTGGCGC 
801 ACATCGTGAG CCGCCTGTTC TACGGCAAGA GCTACTACCA CGTGTACAAG 
851 CTCACGCTGT GTCTCAGCTG CCTCAACAAC TGTCTGGACC CGTTTGTTTA 
901 TTACTTTGCG TCCCGGGAAT TCCAGCTGCG CCTGCGGGAA TATTTGGGCT 
951 GCCGCCGGGT GCCCAGAGAC ACCCTGGACA CGCGCCGCGA GAGCCTCTTC 
1001 TCCGCCAGGA CCACGTCCGT GCGCTCCGAG GCCGGTGCGC ACCCTGAAGG 
1051 GATGGAGGGA GCCACCAGGC CCGGCCTCCA GAGGCAGGAG AGTGTGTTCT 
1101 GAGTCCCGGG GGCGCAGCTT GGAGAGCCGG GGGCGCAGCT TGGAGATCC 
1151 GGGGCGCATG GAGAGGCCAC GGTGCCAGAG GTTCAGGGAG AA 
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FIG. 2. Predict d polypeptide sequence of HG52. 



1 MQVPNSTGPD NATLQMLRNP 

51 RMGPRSPSVI FMINLiSVTDL 

101 VAFYANMYSS ILTMTCISVE 

151 LLTALSPLAR TDLTYPVHAL 

201 LFLIPFVITV ACYTATILKL 

251 APNNFVLLAH IVSRLFYGKS 

301 QLRLREYLGC RRVPRDTLDT 
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AIAVALPWY SLVAAVSIPG NLFSLWVLCR 
MLASVLPFQI YYHCNRHHWV FGVLLCNWT 
RFLGVLYPLS SKRWRRRRYA VAACAGTWLL 
GIITCFDVLK WTMLPSVAMW AVFLFTIFIL 
LRTEEAHGRE QRRRAVGLAA WLLAFVTCF 
YYHVYKLTLC LSCLNNCLDP FVYYFASREF 
RRESLFSART TSVRSEAGAH PEGMEGATRP 
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10 30 50 

CCGGGCGAGGACCCCTCCAGGATGCAGGTCCCGAACAGCACCGGCCCGGACAACGCGACG 

MetGlnValProAsnSerThrGlyProAspAsnAlaThr 

70 90 110 

CTGCAGATGCTGCGGAACCCGGCGATCGCGGTGGCCCTGCCCGTGGTGTACTCGCTGGTG 
LeuGl nMetLeuArgAsnProAl a II eAl aVal Al aLeuProVal Val TyrSerLeuVal 

130 150 170 

GCGGCGGTCAGCATCCCGGGCAACCTCTTCTCTCTGTGGGTGCTGTGCCGGCGCATGGGG 
AlaAlaValSerlleProGlyAsnLeuPheSerLeuTrpValLeuCysArgArgMetGly 

190 210 230 

CCCAGATCCCCGTCGGTCATCTTCATGATCAACCTGAGCGTCACGGACCTGATGCTGGCC 
ProArgSerProSerVal IlePheMetlleAsnLeuSerValThrAspLeuMetLeuAl a 

250 270 290 

AGCGTGTTGCCTTTCCAAATCTACTACCATTGCAACCGCCACCACTGGGTATTCGGGGTG 
SerVal LeuProPheGl nil eTyrTyrHi sCysAsnArgHi sHi sTrpVal PheGlyVal 

310 330 350 

CTGCTTTGCAACGTGGTGACCGTGGCCTTTTACGCAAACATGTATTCCAGCATCCTCACC 
LeuLeuCysAsnValValThrValAlaPheTyrAl aAsnMetTyrSerSerlleLeuThr 

370 390 410 

ATGACCTGTATCAGCGTGGAGCGCTTCCTGGGGGTCCTGTACCCGCTCAGCTCCAAGCGC 
MetThrCysIleSerValGluArgPheLeuGlyValLeuTyrProLeuSerSerLysArg 

430 450 470 

TGGCGCCGCCGTCGTTACGCGGTGGCCGCGTGTGCAGGGACCTGGCTGCTGCTCCTGACC 
TrpArgArgArgArgTyrAl aVal Al aAl aCysAl aGlyThrTrpLeuLeuLeuLeuThr 

490 510 530 

GCCCTGTCCCCGCTGGCGCGCACCGATCTCACCTACCCGGTGCACGCCCTGGGCATCATC 
Al aLeuSerProLeuAl aArgThrAspLeuThrTyrProVal Hi sAl aLeuGlyllel 1 e 

550 570 590 

ACCTGCTTCGACGTCCTCAAGTGGACGATGCTCCCCAGCGTGGCCATGTGGGCCGTGTTC 
ThrCysPheAspVal LeuLysTrpThrMetLeuProSerVal Al aMetTrpAl aVal Phe 
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610 630 650 

CTCTTCACCATCTTCATCCTGCTGTTCCTCATCCCGTTCGTGATCACCGTGGCTTGTTAC 
LeuPheThr 1 1 ePhel 1 eLeuLeuPheLeuI 1 eProPheVal 1 1 eThrVal Al aCysTyr 

670 690 710 

ACGGCCACCATCCTCMGCTGTTGCGCACGGAGGAGGCGCACGGCCGGGAGCAGCGGAGG 
ThrAl aThrlleLeuLysLeuLeuArgThrGI uGl uAl aHi sGlyArgGl uGl nArgArg 

730 750 770 

CGCGCGGTGGGCCTGGCCGCGGTGGTCTTGCTGGCCTTTGTCACCTGCTTCGCCCCCAAC 
ArgAl aVal GlyLeuAl aAl aVaTVal LeuLeuAl aPheValThrCysPheAl aProAsn 

790 810 830 

AACTTCGTGCTCCTGGCGCACATCGTGAGCCGCCTGTTCTACGGCAAGAGCTACTACCAC 
AsnPheVal LeuLeuAl aHisIleValSerArgLeuPheTyrGlyLysSerTyrTyrHis 

850 870 890 

GTGTACAAGCTCACGCTGTGTCTCAGCTGCCTCAACAACTGTCTGGACCCGTTTGTTTAT 
ValTyrLysLeuThrLeuCysLeuSerCysLeuAsnAsnCysLeuAspProPheValTyr 

910 930 950 

TACTTTGCGTCCCGGGAATTCCAGCTGCGCCTGCGGGAATATTTGGGCTGCCGCCGGGTG 
TyrPheAl aSerArgGl uPheGl nLeuArgLeuArgGl uTyrLeuGlyCysArgArgVal 

970 990 1010 

CCCAGAGACACCCTGGACACGCGCCGCGAGAGCCTCTTCTCCGCCAGGACCACGTCCGTG 
ProArgAspThrLeuAspThrArgArgGI uSerLeuPheSerAl aArgThrThrSerVal 

1030 1050 1070 

CGCTCCGAGGCCGGTGCGCACCCTGAAGGGATGGAGGGAGCCACCAGGCCCGGCCTCCAG 
ArgSerGI uAl aGlyAl aHi sProGl uGlyMetGl uGlyAl aThrArgProGlyLeuGl n 

1090 1110 1130 

AGGCAGGAGAGTGTGTTCTGAGTCCCGGGGGCGCAGCTTGGAGAGCCGGGGGCGCAGCTT 
ArgGlnGluSerValPhe 

1150 1170 1190 

GGAGATCCAGGGGCGCATGGAGAGGCCACGGTGCCAGAGGTTCAGGGAGAA 
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10 20 30 

HG52 . pep MQVPNSTGPDNATLQMLRNPAIAVALPVVYS 

|:| : I : :| : ::: :| ||: 
hTHRR EPFWEDEEKNESGLTEYRLVSINKSSPLQKQLPAFISEDASG--YLTSSWLTLFVPSVYT 
60 70 80 90 100 110 



40 50 60 70 80 90 

HG52.pep LVAANVIPGNLFSLWVLCRRMGPRSPSVIFMINLSVTDLMLASVLPFQIYYHCNRHHWVF 

I MM I:::: \ ■ ■ I : : I : I : : I : : I : : : I : : : : I M I I : | |: : | I 
hTHRR GVFVVSLPLNIMAIVVFILKMKVKKPAVVYMLHLATADVLFVSVLPFKISYYFSGSDWQF 

120 . 130 140 150 160 170 



100 110 120 130 140 150 

HG52 . pep GVLLCNVVTVAFYANMYSSILTMTCISVERFLGVLYPLSSKRWRRRRYAVAACAGTWLLL 

I II Ihlll I IN! I II II I M 

hTHRR GSELCRFVTAAFYCNMYASILLMTVISIDRFLAVVYPMQSLSWRTLGRASFTCLAIWALA 

180 190 200 210 220 230 



160 170 180 190 200 210 

HG52.pep LTALSPLARTDLTYPVHALGIITCFDVLKWTMLPSVAMWAVFLFTIF-ILLFLIPFVITV 



hTHRR IAGVVPLVLKEQTIQVPGLNITTCHDVLNETLLEG — YYAYYFSAFSAVFFFVPLIIST 

240 250 260 270 280 



220 230 240 250 260 

HG52 . pep ACYTATILKLLRTEEAHGREQRRRAVGLAAVVLLAFVTCFAPNNFVLLAH- - IVSRLFYG 



hTHRR VCYV-SIIRCLSSSAVANRSKKSRALFLSAAVFCIFIICFGPTNVLLIAHYSFLSHTSTT 
290 300 310 320 330 340 



270 280 290 300 310 320 

HG52 . pep KSYYHVYKLTLCLSCLNNCLDPFVYYFASREFQLRLREYLGCRRVPRDTLDTRRESLFSA 

:: | :| | :|:| : : : | : | | : : | | : | | | | : | |:: 
hTHRR EAAYFAYLLCVCVSSISSCIDPLIYYYASSECQRYVYSILCCKESSDPSSYNSSGQLMAS 
350 360 370 380 390 400 

330 340 350 360 

HG52 . pep RTTSVRSEAGAHPEGMEGATRPGLQRQESVF 

hTHRR KMDTCSSNLNNSIYKKLLT 
410 420 
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